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Abstract. We extend Hamilton-Jacobi theory to Lagrange-Dirac (or implicit Lagrangian) systems, a 
generalized formulation of Lagrangian mechanics that can incorporate degenerate Lagrangians as well as 
holonomic and nonholonomic constraints. We refer to the generalized Hamilton-Jacobi equation as the 
Dirac-Hamilton-J acobi equation. For non-degenerate Lagrangian systems with nonholonomic constraints, 
the theory specializes to the recently developed nonholonomic Hamilton-Jacobi theory. We are particularly 
interested in applications to a certain class of degenerate nonholonomic Lagrangian systems with symmetries, 
which we refer to as weakly degenerate Chaplygin systems, that arise as simplified models of nonholonomic 
mechanical systems; these systems are shown to reduce to non-degenerate almost Hamiltonian systems, 
i.e., generalized Hamiltonian systems defined with non-closed two-forms. Accordingly, the Dirac-Hamilton- 
Jacobi equation reduces to a variant of the nonholonomic Hamilton-Jacobi equation associated with the 
reduced system. We illustrate through a few examples how the Dirac— Hamilton-Jacobi equation can be 
used to exactly integrate the equations of motion. 



1. Introduction 

1.1. Degenerate Lagrangian Systems and Lagrange— Dirac Systems. Degenerate Lagrangian systems 
are the motivation behind the work of Dirac jlBt IT7| [T8] on constrained systems, where degeneracy of 
Lagrangians imposes constraints on the phase space variables. The theory gives a prescription for writing 
such systems as Hamiltonian systems, and is used extensively for gauge systems and their quantization (see, 
e.g., Henneaux and Teitelboim pS]). 

Dirac's theory of constraints was geometrized by Gotay et al. (see also Gotay and Nester [32 [531 [H] 
and Kiinzle |35j ) to yield a constraint algorithm to identify the solvability condition for presymplectic systems 
and also to establish the equivalence between Lagrangian and Hamiltonian descriptions of degenerate La- 
grangian systems. The algorithm is extended by de Leon and Martin de Diego |14| to degenerate Lagrangian 
systems with nonholonomic constraints. 

On the other hand, Lagrange-Dirac (or implicit Lagrangian) systems of Yoshimura and Marsden [S51 [S3] 
provide a rather direct way of describing degenerate Lagrangian systems that do not explicitly involve 
constraint algorithms. Moreover, the Lagrange-Dirac formulation can address more general constraints, 
particularly nonholonomic constraints, by directly encoding them in terms of Dirac structures, as opposed 
to symplectic or Poisson structures. 

1.2. Hamilton Jacob! Theory for Constrained Degenerate Lagrangian Systems. The goal of this 
paper is to generalize Hamilton-Jacobi theory to Lagrange-Dirac systems. The challenge in doing so is to 
generalize the theory to simultaneously address degeneracy and nonholonomic constraints. For degenerate 
Lagrangian systems, some work has been done, built on Dirac's theory of constraints, on extending Hamilton- 
Jacobi theory (see, e.g., Henneaux and Teitelboim [26, Section 5.4] and Rothe and Scholtz [44 ) as well as 
from the geometric point of view by Cariiiena et al. i8|. For nonholonomic systems, Iglesias-Ponte et al. 
[2S] generalized the geometric Hamilton-Jacobi theorem (see Abraham and Marsden [T, Theorem 5.2.4]) 
to nonholonomic systems, which has been studied further by de Leon et al. [TB , Ohsawa and Bloch j42l . 
Carinena et al. [9], and Ohsawa et al. [43]. However, to the authors' knowledge, no work has been done that 
can deal with both degeneracy and nonholonomic constraints. 
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1.3. Applications to Degenerate Lagrangian Systems with Nonholonomic Constraints. We are 

particularly interested in applications to degenerate Lagrangian systems with nonholonomic constraints. 
Such systems arise regularly, in practice, as model reductions of multiscale systems: For example, consider 
a nonholonomic mechanical system consisting of rigid bodies, some of which are significantly lighter than 
the rest. Then, one can make an assumption that the light parts are massless for the sake of simplicity; this 
often results in a degenerate Lagrangian. While naively making a massless approximation usually leads to 
unphysical result^ a certain class of nonholonomic systems seem to allow massless approximations without 
such inconsistencies. See, for example, the modelling of a bicycle in Getz [20] and Getz and Marsden [21] 



(see also Koon and Marsden |33 and Example 3.5 of the present paper 



1.4. Outline. We first briefly review Dirac structures and Lagrange-Dirac systems in Section [2] Section [3] 
introduces a class of degenerate nonholonomic Lagrangian systems with symmetries that reduce to non- 
degenerate Lagrangian systems after symmetry reduction; we call them weakly degenerate Chaplygin systems. 
Section |4] gives Hamilton-Jacobi theory for Lagrange-Dirac systems, defining the Dirac-Hamilton-Jacobi 
equation, and shows applications to degenerate Lagrangian systems with holonomic and nonholonomic con- 
straints. We then apply the theory to weakly degenerate Chaplygin systems in Scctionjsj wc derive a formula 
that relates solutions of the Dirac-Hamilton-Jacobi equations with those of the nonholonomic Hamilton- 
Jacobi equation for the reduced weakly degenerate Chaplygin systems. Appendix discusses reduction of 
weakly degenerate Chaplygin systems by a symmetry reduction of the associated Dirac structure. 

2. Lagrange-Dirac Systems 

Lagrange-Dirac (or implicit Lagrangian) systems are a generalization of Lagrangian mechanics to systems 
with (possibly) degenerate Lagrangians and constraints. Given a configuration manifold Q, a Lagrange-Dirac 
system is defined using a generalized Dirac structure on T*Q, or more precisely a subbundle D of the Whitney 
sum TT*Q®T*T*Q. 

2.1. Dirac Structures. Let us first recall the definition of a (generalized) Dirac structure on a manifold 
M. Let M be a manifold. Given a subbundle D C TM © r*M, the subbundle C TM © T*M is defined 
as follows: 

:= {(A, a) G TM © T* M \ {a', X) + {a, A') = for any (A', a') G D} . 

Definition 2.1. A subbundle D C TM © T* M is called a generalized Dirac structure if = D. 

Note that the notion of Dirac structures, originally introduced in Courant [12], further satisfies an in- 
tegrability condition, which we have omitted as it is not compatible with our interest in nonintegrable 
(nonholonomic) constraints. Hereafter, we refer to generalized Dirac structures as simply "Dirac structures." 

2.2. Induced Dirac Structures. Here we consider the induced Dirac structure Daq C TT*Q © T*T*Q 
introduced in Yoshimura and Marsden [52]. See Dalsmo and van der Schaft [13] for more general Dirac 
structures, Bloch and Crouch [4] and van der Schaft ^49] for those defined by Kirchhoff current and voltage 
laws, and van der Schaft [5G\ for applications of Dirac structures to interconnected systems. 

Let Q be a smooth manifold, Aq C TQ a regular distribution on Q, and D, the canonical symplectic 
two-form on T*Q. Denote by the annihilator of Aq and by : TT*Q T*T*Q the flat map induced 
by O. The distribution Aq C TQ may be lifted to the distribution Aj^q on T*Q defined as 

At.q:= {Tttq)-HAq)cTT*Q, 

where ttq : T*Q ^ Q is the canonical projection and Tttq : TT*Q — > TQ is its tangent map. Denote its 
annihilator by A|,.q (zT*T*Q. 

Definition 2.2 (Yoshimura and Marsden [52l|53]; see also Dalsmo and van der Schaft \T3 ). The induced 
(generalized) Dirac structure D^^ on T*Q is defined, for each z €T*Q, as 

Daq{z) {{v., a,) G T,T*Q®T;T*Q I e At.q(z), a, - Vt\z){v,) € A^.q(z)} . 

^This setting usually gives a singular perturbation problem with the small mass being its parameter. For example, for the 
Lagrangian = ex^ /2 — x'^ /2, the Euler-Lagrange equation gives ex + x = 0; the solution corresponding to the massless 
Lagrangian Lq deviates significantly from the original solution. 
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If we choose local coordinates q — (g*) on an open subset U oi Q and denote by {q,q) — (re- 
spectively, (QjP) = {q^,Pi)), the corresponding local coordinates on TQ (respectively, T*Q), then a local 
representation for the Dirac structure is given by 

DAQ{q,p) = |((g,p,g,p), {q,p,aq,ap)) G T(^q,p)T*Q (ST^q^p)T*Q \ qe Aqiq), ap = q, aq + p e AQ(g)| . 

2.3. Lagrange— Dirac Systems. To define a Lagrange-Dirac system, it is necessary to introduce the Dirac 
differential of a Lagrangian function. Following Yoshimura and Marsden j52j . let us first introduce the 
following maps, originally due to Tulczyjew [171 HH], between the iterated tangent and cotangent bundles. 

T*TQ^ — TT*Q -^T*T*Q {q,5q,6p,p)^ \ {q,p,Sq,Sp)\-^ {q,p,-dp,Sq) ^ ^ 

Let L : TQ — > M be a Lagrangian function and let jq : T*TQ — > T*T*Q be the diffeomorphism defined as 
7Q := °Kq^ (see ([2A])). Then, the Dirac differential of L is the map TlL-.TQ-^ T*T*Q given by 

S)L = 7q o dL. 

In local coordinates. 

Definition 2.3. Let L : TQ — > E be a Lagrangian (possibly degenerate) and Ag C TQ be a given regular 
constraint distribution on the configuration manifold Q. Let 

P:=FL(AQ)cr*g 

be the image of Ag by the Legendre transformation and X be a (partial) vector field on T* Q defined at points 
of P. Then, a Lagrange-Dirac system is the triple (L, Ag, X) that satisfies, for each point z £ P d T*Q, 

(X(2),2)i(u))ei?A«(z), (2.2) 

where u S Ag such that FL(u) = z. In local coordinates, Eq. ( |2.2[ ) is written as 

dL dL 
P=-g;^{q,v), '7eAg(g), q^v, p - —{q,v) G A°Q{q), (2.3) 

which we would like to call the Lagrange-Dirac equations. 

Definition 2.4. A solution curve of a Lagrange-Dirac system {L, Aq, X) is an integral curve {q{t),p{t)), 
ti <t <t2, of X in P C T*Q. 

2.4. Lagrange Dirac Systems on the Pontryagin Bundle TQ®T*Q. We may also define a Lagrange- 
Dirac system on TQ ®T*Q as well. We will use the submanifold % of the Pontryagin bundle introduced 
in Yoshimura and Marsden [52j and the (partial) vector field X on TQ © T*(5, associated with a (partial) 
vector field X on T*Q, defined in Yoshimura and Marsden [S3]. Let us recall the definition of these two 
objects. 

Given a Lagrangian L : TQ — > M, the generalized energy, £ : TQ r*(3 — )• M, is given by 

^{q,v,p) =p-v-L{q,v). 

The submanifold X is defined as the set of stationary points of £(g, v,p) with respect to v, with v £ Ag(g). 
So, X is represented by 

X^i^iq,v,p) eTQ®T*Q\ve Ag(g), p= |^(g,t;)| (2.4) 

This submanifold can also be described as the graph of the Legendre transformation restricted to the con- 
straint distribution Ag. We can also obtain the submanifold X as follows. Let pr^g : TQ © T*Q — )■ TQ be 
the projection to the first factor and tttq '■ T*TQ — )■ TQ be the cotangent bundle projection. Consider the 
map pt'TQ ■ T*TQ TQ ®T*Q (see [52 Section 4.10]) which has the property that Wtq° PT'TQ = t^tq; 
this map is defined intrinsically to be the direct sum of tttq : T*TQ TQ and r-r-g ° Kq^ : T*TQ T*Q 
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(see Section 4.10 in [55]), where tt*q : TT*Q — > T*Q is the tangent bundle projection. Then, we can 
consider the map 

PT'TQ°dL:TQ^TQ(BT*Q, 

whose local expression is 

dL 



PT'TQ°dL{q,v) ^ yq,v,—{q,v) 
Therefore, we have 

X = pT*TQ°dL{AQ). 

Now, given a (partial) vector field X onT*Q defined at points of P, one can construct a (partial) vector 
field X on TQ(BT*Q defined at points of 3C as follows (see Section 3.8 in [53 ). For (g, v,p) G OC, X{q, v,p) is 
tangent to a curve {q{t),v{t),p{t)) in TQ(BT*Q such that (g(0), w(0),p(0)) = {q,v,p) and X{q,p) is tangent 
to the curve {q{t),p{t)) in T*Q. This (partial) vector field X is not unique; however it has the property that, 
for each x eXciTQ® T*Q, 

Tpir^,Q{X{x)) = X{pTrp,Q{x)), 

where pr^^.g : TQ (B T* Q ^ T* Q is the projection to the second factor. 

On the other hand, from the distribution Aq on Q, we can define a distribution Atqi^t'Q on TQ (BT*Q 

by 

where prg : TQ®T*Q Q. Note that Atq^t-q = {Tprrp,Q)~^{AT'Q), since prg = Trgopr^.g. Then, 
as pr^.gfi is a skew-symmetric two-form on TQ<S)T*Q, we can consider the following induced (generalized) 
Dirac structure on TQ ® T*Q: 



D- 



TQ<STH 



'Q{x)-^{{i.,&.)e T,{TQ (BT*Q)x T^iTQ ®T*Q)\ 

Vx e Atq(bT'q{x), ax - {pT::*rp.Qn)^{x){vx) € A5.Qgjj..Q(x)|, 
for X S TQ Q)T*Q. A local representation for the Dirac structure Dtq^t-q is 

DTQ(BT*Qiq,v,p) = ^{{q,v,p,q,v,p), {q,v,p,ag,ay,ap)) \ 

9 G Aq(9)> q, Ofi, = 0, + p e AQ(g)|. 

Then, we have the following result. 

Theorem 2.5. For every u e Aq, define z :— VL(u) £ P and x := Pt-tq ° dL{u) £ 3C so that prytQ(x) = z. 
Then, we have 

{X{z),DL{u)) e Da,,{^) ^ {X{x),dE{x))£DTQ(sT'Q{x). 
Proof. It is not difficult to prove that the condition {X{x),dt{x)) E T)tq^t'q{x) locally reads 

dL dL 

P^-g;^{q,v), '?eAQ(g), q^v, p - —{q,v) <E A°Q{q), 



that is, the Lagrange-Dirac equations (2.3). So, we have the equivalence. □ 

As a consequence, we obtain the following result which was obtained by Yoshimura and Marsden (see 
Theorem 3.8. in [52]). 

Corollary 2.6. // {q{t),p(t)) — ¥L(q{t), v{t)), ti < t < t2, is an integral curve of the vector field X on P, 
then pt'TQ ° dL{q{t),v{t)) is an integral curve of X on 3C. Conversely, if {q{t),v{t),p{t)) , ti < t < t2, is an 
integral curve of X on 3C, then pvrp, Q[q{t) , v(t) , p{t)) is an integral curve of X . 

Therefore, a Lagrange-Dirac system on the Pontryagin bundle is given by a triple (£, 3C, X) satisfying the 
condition 

{Xix),d8.{x)) e Dtq<st'q{x), 

for all X eX. 
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3. Degenerate Lagrangian Systems with Nonholonomic Constraints 

If one accurately models a mechanical system, then one usually obtains a non- degenerate Lagrangian, 
since the kinetic energy of the system is usually written as a positive-definite quadratic form in their velocity 
components. However, for a complex mechanical system consisting of many moving parts, one can often 
ignore the masses and/or moments of inertia of relatively light parts of the system in order to simplify the 
analysis. This turns out to be an effective way of modeling complex systems; for example, one usually models 
the strings of a puppet as massless moving parts (see, e.g., Johnson and Murphey [29^ and Murphey and 
Egerstedt [40]). With such an approximation, the Lagrangian often turns out to be degenerate, and thus the 
Euler-Lagrange or Lagrange-d'Alembert equations do not give the dynamics of the massless parts directly; 
instead, it is determined by mechanical constraints. In other words, the system may be considered as a 
hybrid of dynamics and kinematics. 

We are particularly interested in systems with degenerate Lagrangians and nonholonomic constraints, 
because they possess the two very features that Lagrange-Dirac systems can (and are designed to) incorporate 
but the standard Lagrangian or Hamiltonian formulation cannot. 

In this section, we introduce a class of mechanical systems with degenerate Lagrangians and nonholonomic 
constraints with symmetry that yield non-degenerate almost Hamiltonian system^ on the reduced space 
when symmetry reduction is performed. 

3.1. Chaplygin Systems. Let us start from the following definition of a well-known class of nonholonomic 
systems: 

Definition 3.1 (Chaplygin Systems; see, e.g., Koiller [55] and Hochgerner and Garcfa-Naranjo [57]). A 
nonholonomic system with Lagrangian L and distribution Aq is called a Chaplygin system if there exists a 
Lie group G with a free and proper action on Q, i.e., $ : G x Q — > Q or <I>g : Q — > Q for any g E G, such 
that 

(i) the Lagrangian L and the distribution Aq are invariant under the tangent lift of the G-action, i.e., 
LoT^g = L and T^g{^Q{q)) = ^qigq); 

(ii) for each q £ Q, the tangent space TqQ is the direct sum of the constraint distribution and the tangent 
space to the orbit of the group action, i.e., 

r,Q = Aq((7)®T,0„ 

where 0^ is the orbit through q of the G-action on Q, i.e., 

0, -.^ {^g{q) e Q \ g e G} . 

This setup gives rise to the principal bundle 

Tr:Q^Q/G=:Q 

and the connection 

A:TQ^Q, (3.1) 

with Q being the Lie algebra of G such that keiA — Aq, i.e., the horizontal space of A is Aq. Furthermore, 
for any q G Q and q := 7T{q) G Q, the map TqTr\^^(^q^ : AQ(g) — >■ TqQ is a linear isomorphism, and hence we 
have the horizontal lift 

h\f : TqQ Aqiq); Vq ^ (T,7r|AQ(g))~^(wq). 
We will occasionally use the following shorthand notation for horizontal lifts: 

V^q:^h\f{Vq). 

Then, any vector Wq G TqQ can be decomposed into the horizontal and vertical parts as follows: 

Wq = hoi{Wq)+ver{Wq), 

with 

hOY{Wq) = hlf (Wq), Ver{Wq) = {Aq{Wq))Q{q), 

where Wq := TqTr{Wq) and £_q G X{Q) is the infinitesimal generator of e g. 



An almost Hamiltonian system is a generalized Hamiltonian system defined with a non-closed two-form as opposed to a 
symplectic form (which is closed by definition) [21 127j . 
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is of the form 
1 



Suppose that the Lagrangian L : TQ - 

]g,iv,,v,)~V{q), (3.2) 

where g is a possibly degenerate metric on Q. We may then define the reduced Lagrangian 

L:=Lo hl"^, 

or more exphcitly, 

L:TQ 

where g is the metric on the reduced space Q induced by g as foUows: 

gq{Vq,Wq) := gq {\Aq{Vq)Mq{Wq)^ = 

and the reduced potential : Q — M is defined such that V — V °tt. 

3.2. Weakly Degenerate Chaplygin Systems. The following special class of Chaplygin systems is of 
particular interest in this paper: 

Definition 3.2 (Weakly Degenerate Chaplygin Systems). A Chaplygin system is said to be weakly degenerate 
if the Lagrangian L : TQ — >■ K is degenerate but the reduced Lagrangian L : TQ — > M is non-degenerate; 
more precisely, the metric g is degenerate on TQ but positive-definite (hence non-degenerate) when restricted 
to Aq C TQ, i.e., the triple {Q,AQ,g) defines a sub-Riemannian manifold (see, e,g., Montgomery J3T5|), and 
the induced metric ^ on Q is positive-definite and hence Riemannian. 

Remark 3.3. This is a mathematical description of the hybrid of dynamics and kinematics mentioned 
above; The dynamics is essentially dropped to the reduced configuration manifold Q :— Q/G, and the 
rest is reconstructed by the horizontal lift hi , which is the kinematic part defined by the (nonholonomic) 
constraints. 



We will look into the geometry associated with weakly degenerate Chaplygin systems in Section 5.1 



Example 3.4 (Simplified Roller Racer; see Tsakiris fiH^ and Krishnaprasad and Tsakiris and Bloch 
[3t Section 1.10]). The roller racer, shown in Fig. [l| consists of two (main and second) planar coupled rigid 
bodies, each of which has a pair of wheels attached at its center of mass. We assume that the mass of the 
second body is negligible, and hence so are its kinetic and rotational energie^ Let {x,y) be the coordinates 
of the center of mass of the main body, 9 the angle of the line passing through the center of mass measured 
from the x-axis, (j) the angle between the two bodies; di and d2 are the distances from centers of mass to the 
joint, mi and Ii the mass and inertia of the main body. 

The configuration space is Q = SE{2) x = {{x, y, 9, (f>)}, and the Lagrangian L : TQ — > M is given by 



L = -mi{i' + + -h 



fl2 



which is degenerate because of the massless approximation of the second body. 
The constraints are given by 



y 



sm f CSC ( 



i; = cos0csc0 (dicoscj) + d2)9 + d2 
Defining the constraint one-forms 

u>^ :— dx — cos 9 esc <j)[{di cos (j) + d2)d9 + d2 d(f> 
we can write the constraint distribution Ag C TQ as 

AQ = [q= (i, y, 9, 0) e rg I Lu'^iq) = 0, a = 1, 2} 



{di COS0 + d2] 
^ '■= dy — sin 9 esc <j)[{di cos (j) H 



(3.3) 

d2)d9 + d2d(t)l (3.4) 



■^We note that, in the original model in 1341 1461 . only the kinetic energy of the second body is ignored, and its rotational 
energy is taken into account; one obtains a non-degenerate Lagrangian with such an approximation. 
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z 




The Lagrange-Dirac equations (2.3) give 
V. 

X — cos Q esc (I 



miVx, Py = miVy, 
{di cos (j) + d2)6 + d2(p , 



V 



sin 9 esc < 



P0 = 0, 
(di cos (j) + d2)9 + d2(, 



(3.5) 



X = Vx:, y = Vy, U = Ve, (p = v^, 

p^ = \sm9, ■py = -\cos9, = 0, = 0, 

where A is the Lagrange multipher. 

Let G — M.'^ and consider the action of G on Q by translations on the x-y plane, i.e. 

GxQ ^Q; {{a, b), {x, y, 9, (f))) ^ {x + a,y + b, 9, <j>). 

Then, the tangent space to the group orbit is given by 

d d 



TqO{q) = span 



dx ' dy 



with q = (x, 0, (f). It is easy to check that this defines a Chaplygin system in the sense of Definition 3.1 
The quotient space is Q Q/G = {(6', (/i)}, and the horizontal lift hl'^ is 



hlf(^. 



cos y CSC ( 



[di cos< 



Hence, the reduced Lagrangian L : TQ 

1 



L ~ ^di^ cos 



d2)9 + d2 

is given by 

d2{9 



, sm a CSC ( 



CSC 



{di cos (p + d2)9 + d2(j> 



1 



(3.6) 



which is non-degenerate; hence the simplified roller racer is a weakly degenerate Chaplygin system. 

Therefore, the dynamics of the variables 9 and cj) are specified by the equations of motion, which together 
with the (nonholonomic) constraints, Eq. (3.3 1, determine the time evolution of the variables x and y. 

Example 3.5 (Bicycle; see Gctz [20 , Gctz and Marsden J21j, and Koon and Marsden [33J. Consider the 
simplified model of a bicycle shown in Fig. [2} For the sake of simplicity, the wheels are assumed to be 
massless, and the mass m of the bicycle is considered to be concentrated at a single point; however we take 
into account the moment of inertia of the steering wheel. 

The configuration space is Q = SE{2) x x = {{x, y, 9, (j), the variables x, y, 9, and ip are defined 
as in Fig. [2] and (j) tana/b; also let J(0, ip) be the moment of inertia associated with the steering action. 
The Lagrangian L : TQ — > M is given by 



L 



m 
~2 



(cos i + sin 9y + a sin 9)"^ + (sin 9 x — cos 9y + a cos — c9)'^ 

+0^ sin ip -0^ 

which is degenerate. The constraints are given by 



J(0,V) 



(j) — mga cosip, 
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Figure 2. Bicycle (taken from Koon and Marsden [33] with permission from Wang Sang Koon). 

6 = (t){cos6 X + sm9y), sinOx — cosOy = 0. 
Defining the constraint one-forms 

cj^ := (l){cos9 dx + sinO dy), lo^ := sin da; — cos 9 dy, 
we can write the constraint distribution Ag C TQ as 

Aq - {q = {x,y,9j,ij) e TQ I LJ^iq) = 0, a = 1,2} . 

Let G — M.'^ and consider the action of G on Q by translations on the x-y plane, i.e., 
GxQ ^Q; ((a, 5), {x, y, 9, ^ {x + a,y + b,9, 0, 

Then, the tangent space to the group orbit is given by 

f d d 

TqO{q) = span 



dx' dy \ ' 



with q = (x, y, 0, 0, tj}). It is easy to check that this defines a Chaplygin system in the sense of Definition 3.1 
The quotient space is Q :— Q/G ~ {(6', (/i, -0)}, and the horizontal lift hl^ is 

hlf'(0,0,V') = I %os6', ^ sin 61, ^,0,V>) . 



Hence, the reduced Lagrangian L : TQ M is given by 
m 



^ 2 



{c9 - a cos -tpipy + ' + sm V' "0 



H ^ (p —mga cos tp, 



which is non-degenerate, and so this is a weakly degenerate Chaplygin system as well. 

4. Hamilton- Jacobi Theory for Lagrange-Dirac systems 

4.1. Hamilton— Jacobi Theorem for Lagrange-Dirac systems. We now state the main theorem of 
this paper, which relates the dynamics of the Lagrange-Dirac system with what we refer to as the Dirac- 
Hamilton- Jacobi equation. 

Theorem 4.1 (Dirac-Hamilton- Jacobi Theorem). Suppose that a Lagrangian L : TQ — >■ M and a distribu- 
tion Aq C TQ are given. Define T : Q — > TQ Q) T*Q by 

T{q) :=X(g)®7(g), 

with a vector field X : Q — >■ TQ and a one-form j : Q -^T*Q, and assume that it satisfies 

Tiq)eX,foranyqeQ, (4.1) 

and 

c^tIaq = 0, i.e., d'y{v, w) = for any v,w £ Ag. (4.2) 



H-J THEORY FOR LAGRANGE-DIRAC SYSTEMS 



9 



Then, the following are equivalent: 

(i) For every integral curve c{t) ofX, i.e., for every curve c : M — > Q satisfying 

c{t) = X(c(t)), (4.3) 
the curve t ^ T o c(t) = (X© 7) °c(t) is an integral curve of the Lagrange-Dirac equations ( |2.3[ ). 
(ii) T satisfies i/ie /oZZowm^ Dirac-Hamilton-Jacobi equation; 

rf(£oT)eA^, (4.4) 
or, if Q is connected and Aq is completely nonholonomi^ 

£oT = E, (4.5) 

with a constant E. 

Proof. Let us first show tliat ^ implies Assume ^ and let c{t) be an integral curve of X, and then set 

v{t) ® p{t) := T o c{t) = (X e 7) o c{t). 



Then, clearly vit) = c{t) = X{c(t)). Also, Eq. ( |4~T| ) implies that 

dL 

«(i) e AQ(c(t)), p[t)^—{q{t),v{t)). 
So it remains to show p — dL/dq E Aq. To that end, first calculate 

Mt) = |7,°c(t) = ^{c{t))c\t) = ^{c{t))T{c{t)) 
and so, for any w € Aq, we have 

MtW = ^{cit))T{cit))w^ = ^{c{t))T{c{t))w^, (4.6) 



since Eq. (4.2 1 implies, for any v,uj £ Aq, 



—^v = — 
oq-i aq^ 



and also Eq. (4.1) gives X(g) e AQ{q). On the other hand, 

d(£oT) =d(7,(g) T{q)-L{q,X(q))) 

^X'+7.^-^-^^|d,^ 
dq^ * dq^ dq^ dv^ dq^ 

P^T-^) dq^, 
oqi oq^ J 



where we used the following relation that follows from Eq. (4.1): 

7(<z) = |^(<Z,X(g)). 



So the Dirac-Hamilton-Jacobi equation ( |4.4[ ) with Eq. (4.6) implies 

d(£oT)(c(t)) . = (^^^{c{t))T{c{t)) - ^{c{t),v{t)) ] 

P,{t)-^{cit),vit))^w^ ^0. 



Since w € Aq is arbitrary, this implies 

dL 
dq 

Therefore, (p| is satisfied. 



m-^{c{t),v{t))eA°Q. 



'^A distribution Aq C TQ is said to be completely nonholonomic (or bracket-generating) if Aq along with all of its iterated Lie 
brackets [Aq, Aq], [Aq, [Aq, Aq]], . . . spans the tangent bundle TQ. See, e.g., Vershik and Gershkovich ,51 and Montgomery 
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Conversely, assume (|i]); let c{t) be a curve in Q that satisfies Eq. (4.3) and set v{t) (B p{t) := Toc(t) = 
(X©7)oc(i). Then, by assumption, {c{t),v{t),p{t)) is an integral curve of the Lagrange-Dirac system ( |2.2[ ), 
and so 

Following the same calculations as above we have, for any w G Ag, 



For an arbitrary point q e Q, we can consider an integral curve c{t) of X such that c(0) = q. Therefore, 
the above equation implies that d(£ o T)(q) ■ Wq = for any q € Q and G Aq((7), which gives the Dirac- 
Hamilton-Jacobi equation ( |4.4[ ). If Q is connected and Ag is completely nonholonomic, then by the same 
argument as in the proof of Theorem 3.1 in Ohsawa and Bloch J42], d(£ ° T) G Ag reduces to £ o T = £' for 
some constant E. □ 

Theorem 14. II can be recast in the context of Section [Z4l as follows: 



Corollary 4.2. Under the same conditions as in Theorem 4-1 the following are equivalent: 
(i) For every curve c{t) such that 

c{t) = TprQ-X(Toc{t)), 

the curve t i— > Toc(t) is an integral curve of X , and so it is an integral curve of the Lagrange-Dirac 
equations ( |2.3[ ). 

(a) T satisfies (0, (i(£ o T o prg)) e Dtq(st*q, or equivalently, (i(£oToprg) e A^g^^.g. 



Proof. The equivalence of (pl with that of Theorem 4.1 follows from the relation Tprg o X o T = X, which is 



easily checked by coordinate calculations. 

On the other hand, for first observe that prQ(AQ) = A^q^^.q. Then, since prg : TQ ®T*Q ^ Q 
is a surjective submersion, it follows that 

d(£oT)eA^ ^ pr^d(£oT)epr^(A^) ^ rf(£ o T o pr^) e A^^g^T-g- 

This proves the equivalence of ^ with that of Theorem 4.1 □ 



4.2. Nonholonomic Hamilton— Jacobi Theory as a Special Case. Let us show that the nonholonomic 
Hamilton- Jacobi equation of Iglesias-Ponte et al. [28 and Ohsawa and Bloch |32 follows as a special case 
of the above theorem. Consider the special case where the Lagrangian L : TQ ^ M is non-degenerate, i.e., 
the Legendre transformation FL : TQ — >■ T*Q is invertible. Then, we may rewrite the definition of the 
submanifold 3C C © T*Q, Eq. by 

3C = K ® e TQ ® r*Q I e Ag(<z), - FL(t-,)} 

= {Vq®Pq e TQ®T*Q I pq e Pq, Vq = {¥L)-\pq)} 

= Aq(BP, 

where we recall that P FL(Ag). It implies that if T = X ® 7 then X = (Fi)-i 07, and thus 

£oT(q) - (7(q),(FL)-i(7(g))) -Lo(FL)-i(7(9)) = ifo7('?), 
with 7 taking values in P and the Hamiltonian H : T* Q ^ R defined by 

H{q,p) := {pq, {¥Ly\pq)) - Lo (FL)-i(p,). 
Then, the Lagrange-Dirac equations (2.3l become the nonholonomic Hamilton's equations 

OH on 



or, in an intrinsic form, 
for a vector field X^j^ on T*Q. Furthermore, it is straightforward to show that 



~dHe A^.Q, TngiX]}') G Ag 



{WPy^ =¥H = Tttq°Xh, 
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where Xh is the Hamihonian vector field of the unconstrained system with the same Hamiltonian, i.e., 
ixn^ ^ hence we obtain 

Therefore, Theorem |4.1| speciahzes to the nonholonomic Hamilton- Jacobi theorem of Iglesias-Ponte et al. 
pg] and Ohsawa and Bloch [42 : 

Corollary 4.3 (Nonholonomic Hamilton-Jacobi [281 142|). Consider a nonholonomic system defined on a 
configuration manifold Q with a Lagrangian of the form Eq. ( |3.2[ ) and a nonholonomic constraint distribution 
Aq C TQ. Let J : Q T* Q be a one- form that satisfies 

7(a) G Pq for any q G Q, 

and 

c^tIaq = 0, i.e., d'yiv, w) ~ Q for any v,w £ Aq. 
Then, the following are equivalent: 

(i) For every curve c{t) in Q satisfying 

m^TiTQ-XHij^cit)), 

the curve t i~> 'y°c{t) is an integral curve of X^, where Xh is the Hamiltonian vector field of the 
unconstrained system with the same Hamiltonian, i.e., ixH^ ~ ■ 

(a) The one-form 7 satisfies the nonholonomic Hamilton-Jacobi equation.' 

diHo^)eA°Q, 

or, if Q is connected and Aq is completely nonholonomic, 

Hoj = E, 

with a constant E. 

4.3. Applications to Degenerate Lagrangian System with Holonomic Constraints. If the con- 
straints are holonomic, then the distribution Aq C TQ is integrablc, and so there exists a local submanifold 
S C Q such that TgS — Aq{s) for any s E S. Let Lg : S ^ Q he the inclusion. Then, the Dirac-Hamilton- 



Jacobi equation (4.4 1 gives 

4d(£oT) e {TSy = 0, 

and thus 

which implies that we have 

£oTo(,s = £;, (4.7) 

with a constant E. 



On the other hand, the condition (4.2) becomes 

4d7 = rf(7°i5) = 0, (4.8) 
and so 7 o is = dW for some function W defined locally on S. 

Example 4.4 (LC circuit; see Yoshimura and Marsden [52l[54]). Consider the LC circuit shown in Fig. |3] 
The configuration space is the 4-dimensional vector space Q = {(^f, ^ci: 902: ^ca)}: which represents charges 

i 

I — W i 1 



C2 



Cl 



C3 



Figure 3. LC circuit (see Yoshimura and Marsden [S2]). 



12 



M. LEOK, T. OHSAWA, AND D. SOSA 



in the circuit elements. Then TQ ^ Q x Q and fq — {fi, fa, fc2j fcs) G TqQ represents the currents in the 
corresponding circuit elements. The Lagrangian L : TQ — > M is given by 



2 I C\ 2 C2 2 C3 



which is clearly degenerate. 

The generalized energy £ : TQ ® T*Q ^ M is 



^Pih+Pcjc, +PcJc. +PcJcs -Uf! + 1^ + 1^ + 1^. 

2 2 Ci 2 C2 2 

The Kirchhoff current law gives the constraints —fe + fc2 — ^ £^nd fc^ — /c^ + /c., = 0, or in terms of 
constraint one-forms, — —dqi + dqc^ and lo^ — dq^^ — dq^^ + dq^,^. Thus, the constraint distribution 
Aq C TQ is given by 

AQ = {/erg|w'^(/)=0, a = l,2}. 
So the submanifold %^TQ®T*Q is 

3^ - {(«, I.P) ^TQ® T*Q I /, = = + /,3, p, ^£f,, p,^ = p,, = = 0} . 

Hence, the generalized energy constrained to 3C is 

2 2 Ci 2 C2 2 

Notice that the constraints are holonomic, i.e., the constraints can be integrated to give 

qe - qc2 = «o, qc2 - qci - qca = ai, 

with some constants and ai. So we define a submanifold C Q by 

S {{qi,qc,,qc2,qc3) ^ Q \ qc2 qi - ao, qc;, = qc2 - 9ci - aj = {{qe,qc,)}, 
and the inclusion 

Ls ■■ S ^ Q; {qi,qci) ^^ {qe,qci,q£ - ao,qc2 ~ qct ~ a-i)- 
Now, the Dirac-Hamilton-Jacobi equation for holonomic systems, Eq. (4.7), gives 

£°Tot5 = £;, (4.9) 

with some constant E, where T o ls : S TQ ®T*Q is 

^ °i's{qt,qci) = (|(?f,gci,i(g£,gci),7('?f,'7ci)) 

with X := X o ig : S* — > TQ and j '■— j ° is '■ S T* Q given by 

^(^^^ci) = (x£(gi.,gci),Xci(g£,gci),5;c2((7£,gci),Xc3(g£,gci)) , 
i{qi,qct) = (7£(g<;, gci),7ci(g<;,gci),7c2 <?ci ), 7c3 <?ci )) ■ 

The condition T o Ls{qi, qci) G 3C implies 

Xf = Xc2, Xc2 = Xci + Xc3, 7£ = £Xi, = 702 = 7c3 = 0- 

Then, 

7 = 7°'-S = e^e{qi,qcjdqi, 

and thus condition (4.8 1 gives 



dqci 



0, 



and hence Xg{qe,qc-^) — Xf(qe). The Dirac-Hamilton-Jacobi equation (4.9) then becomes 
2 2 ci 2 C2 2 C3 
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We impose the condition that = when qi — ^ Q and E = which corresponds to the case where 
nothing is happening in the circuit. Then, we have 



C2 



(qq + Oi)^ 
C3 



0, 



which gives ag = ai = 0, since C2 and C3 are both positive. Therefore, Eq. (4.10) becomes 

Iql^ Iqj l{qi-qc,f 



-iXfiqe)'^ -r r - — 

2 ' 2 ci 2 C2 



E. 



2 C3 

Taking the derivative with respect to qc^ of both sides and solving for , we have 

ci 



(4.11) 



Substituting this into Eq. (4.111 gives 

1 
2 



ci + C3 



n.^ ^ ^2 Cl +C2 +C3 2 1 p 

IXiiqi) + — -q/, ^E. 

C2(Cl+C3) 



Solving for ^^(qf), we obtain 



C2(C1 + C3) 



Taking the positive root, Eq. (4.31 for qe gives 



qe = 



2E- 



C1+C2 + C3 

C2(C1 + C3) 



which can be solved easily: 



where 



2E 



2 sm{iyt + a), 



C1+C2 + C3 



C2(C1 +C3)i 

and a is a phase constant to be determined by the initial condition. 

Remark 4.5. In the conventional LC circuit theory, one often simplifies problems by "combining" capacitors. 
Using this technique, the above example simplifies to an LC circuit with an inductor with inductance £ and 
a single capacitance C, that satisfies the following equation: 

1 1 1 



which gives 



C C2 Ci + C3 ' 

^ ^ C2(C1 +C3) 

Ci + C2 + C3 ■ 

Then, the equation for the current ii q£ is given by 



df^ C 



= 0, 



with 



df^ 



1 



+ i^ie = 0, 



C1+C2 + C3 



^ic M 02(01 + 03) i 

which coincides the one defined above. The general solution of the above ODE is 

(t) — qi (t) ~ Asin{i^t + a) 
for some constants A and a. Therefore, our solution is consistent with the conventional theory. 
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4.4. Applications to Degenerate Lagrangian System with Nonholonomic Constraints. 



Example 4.6 (Simplified Roller Racer; see Example 3.4). The submanifold X C TQ © T*Q is given by 
3C = v,p) e TQ Q)T*Q \ Vx = cos 9 CSC (j)[{di cos (f> + ve + d2t'0], 

Vy = sin6'csc(/)[(di cos(/) + ^2) ve + ^2"^], = miVx, Py = miUj,, = /lUg, = o|, 
and the generalized energy constrained to OC is 

£|3C = csc^ [(di COS0 + c?2) we + o?2W0]^ + ^^i^e- 

The distribution Ag is easily shown to be completely nonholonomic, and thus we may use the Dirac- 
Hamilton-Jacobi equation (4.5), which gives 



^TOiCSC^0[(di cos + ^2 )Xe(g) +^2X0(17)]^ + ^IiXg{q)'^ = E. 



Now, we assume the following ansat^ 



(4.12) 



(4.13) 



However, substituting them into Eq. (4.12) and solving for Xg shows that Xg does not depend on 9 either; 
hence we set Xg(9, c 



Xg{(/)). Then, solving Eq. (4.12) for X^, we have 

-{di COS0 + d2)Xg{(j)) ± sm(t)J2E - IiXg{(j)Y 

X0(0) = 



(4.14) 



Substituting the first solution into condition (4.2), we have 

(di cos0 + d2)Xe(</>) - sin0v/2£; - /iXe(^)2l X'gicp) = 0. 
We choose Xg((/)) = and hence 

Xg{4>) = Vg, 

where vg is the initial angular velocity in the 0-dircction. This is consistent with the Lagrange-Dirac 
equations (3.5), which give 9 — 0. Substituting this into the first case of Eq. (4.14), we obtain 

X0(0) 



-«e 1 + T cos ( 

0.2 



where Vr ■= \/ {^E — Iivf)/mi. 



Then, the condition X{q) e Aq((7) gives the other components of the vector field X, and hence Eq. (4.3) 
gives 

X — Vj, cos 9, y = Vr sin 61, 



-0, 



-vg 1 



di 



COS ( 



smc 



We can solve the last equation by separation of variables, and the rest is explicitly solvable. 

4.5. Lagrangians that are Linear in Velocity. There are some physical systems, such as point vor- 
tices (see, e.g.. Chapman [11] and Newton [41|), which are described by Lagrangians that are linear in 
velocity, i.e., 

L{q,q)^{a{q),q)-h{q), (4.15) 
where a is a one-form on Q. The Lagrangian is clearly degenerate and Lagrange-Dirac equation (2.3[) gives 



the following equations of motion (see Rowley and Marsden [45] and Yoshimura and Marsden [54 ) : 

- ixda = dh, (4.16) 



where X is a vector field on Q; hence the Lagrange-Dirac equation (2.3) reduces to the first-order dynamics 
q = X(q) defined on Q. 



^The (x, j/)-dependence is eliminated because we expect that the vector field X to be R'^-translational invariant since the 
system possesses R^-symmetry. 
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Now, the assumption in (4.1 ) of Theorem 4.1 implies 7(9) = o:(q) and thus 



£oT(g) = /i(g); 



so the Dirac-Hamilton-Jacobi equation (4.4) gives 

Hq) = E, 

which simply defines a constant energy surface of the dynamics on Q, i.e., the Dirac-Hamilton-Jacobi 



equation (4.51 does not give any information on the dynamics on Q. This is because the original dynamics, 



which is naturally defined on Q with the one-form a and the function h, is somewhat artificially lifted to 



the tangent bundle TQ through the linear Lagrangian (4.151. In fact, for point vortices on the plane, one 



has Q — , and the two- form —da is a symplectic form; hence Q = is a symplectic manifold and 



Eq. (4.16) defines a Hamiltonian system on Q with the Hamiltonian h. Therefore, for such systems, the 



Hamilton- Jacobi equation should be naturally formulated on Q, not on the Pontryagin bundle TQ T*Q. 

5. Hamilton- Jacobi Theory for Weakly Degenerate Chaplygin Systems 

In this section, we first show that a weakly Chaplygin system introduced in Section |3.2| reduces to an 
almost Hamiltonian system on T*Q with a reduced Hamiltonian H : T*Q 'R. Accordingly, we may 
consider a variant of the nonholonomic Hamilton-Jacobi equation for the reduced system, which we 

would like to call the reduced Dirac-Hamilton-Jacobi equation. We then show an explicit formula that maps 
solutions of the reduced Dirac-Hamilton-Jacobi equation to those of the original one. Thus, one may solve 
the reduced Dirac-Hamilton-Jacobi equation, which is simpler than the original one, and then construct 
solutions of the original Dirac-Hamilton-Jacobi equation by the formula. 

5.1. The Geometry of Weakly Degenerate Chaplygin Systems. For weakly degenerate Chaplygin 
systems, the geometric structure introduced in Sect ion [3.1 [ is carried over to the Hamiltonian side. Specifically, 
we define the horizontal lift hl^ : T|(5 — > by (see Ehlers et al. [IS]) 

hlf :=FL,ohlf o(FL),i, 
or by requiring that the diagram below commutes. 

A A 
I _ 

(FL)ri 

It is easy to show that the following equality holds for the pairing between the two horizontal lifts (see 
Lemma A.l in Ohsawa et al. [43)): For any £ T^Q and Vq E TqQ, 

(m^ia,)Mfiv,)) = {a„v,). (5.1) 

We also define a map hl^ : r|Q ^ OC, C T^Q T*Q by 

hlf := (hlf o(FZ)ri)ehlf . 

Since the reduced Lagrangian L is non-degenerate, we can also define the reduced Hamiltoniarj^ H : 
T*Q M as follows: 

Hip,):^{p,,v,)-Liv,), (5.2) 

with Vq = {¥L)g\pq). 

Lemma 5.1. The generalized energy £ : TQ (B T*Q — > M and the reduced Hamiltonian H are related as 
follows: 

£ o hF = H. 



^Recall that we cannot define a Hamiltonian H : T*Q —^ M for the original system because the original Lagrangian 
L : TQ — >■ R is degenerate. 
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Proof. Follows from the following simple calculation: For an arbitrary aq G Q , let q E Q he a. point such 
that 7r(g) = q. Then, we obtain 



£ o hlf (a,) = (hlf ia,),h\f ° (FZ),-^(a,)) - L o hlf o (¥L)^\a,) 
= (a„ (FL)ri(a,)) - Lo (FL)ri(a,) 



where we used Eq. (5.1) and the definition of H in Eq. (5.2 1 



□ 



Furthermore, as shown in Theorem A. 4 of the Appendix [A| (see also Koiller [32], Bates and Sniatycki [5], 
Cantrijn et al. [7], Hochgerner and Garci'a-Naranjo [57]), we have the reduced system 



on T*Q defined with the reduced Hamiltonian H and the almost symplectic form 



(5.3) 
(5.4) 



where ^ is the non-closed two-form on T*Q defined in Eq. (A. 9 1 



5.2. Hamilton— Jacobi Theorem for Weakly Degenerate Chaplygin Systems. The previous subsec- 
tion showed that a weakly degenerate Chaplygin system reduces to a non-degenerate Lagrangian and hence 



an almost Hamiltonian system (5.3). Moreover, Lemma 5.1 shows how the generalized energy £ is related 



to the reduced Hamiltonian H; see also the upper half of the diagram (5.5) below. The lower half of the di 



agram suggests the relationship between the reduced and original Dirac-Hamilton-Jacobi equations alluded 
above. Specifically, 7 is a one-form on Q := Q/G and is a solution of the reduced Dirac-Hamilton-Jacobi 
equation (5.6) defined below, and the diagram suggests how to define the map T : Q — >■ 3C so that it is a 



solution of the original Dirac-Hamilton-Jacobi equation (4.4). 




(5.5) 



The whole diagram (5.5) leads us to the following main result of this section: 



Theorem 5.2 (Reduced Dirac-Hamilton-Jacobi Equation). Consider a weakly degenerate Chaplygin system 
on a connected configuration space Q and assume that the distribution Aq is completely nonholonomic. Let 
J : Q T* Q be a one-form on Q that satisfies the reduced Dirac-Hamilton-Jacobi equation 



with a constant E, as well as 



H°l{q)=E, 



d^ + j*E = 0, 



(5.6) 
(5.7) 



where ^ is the two-form on T*Q that appeared in the definition of the almost symplectic form f2" in Eq. (5.4 ) 



(see also Eq. {A.9\). Define T = X®7:Q->3C by (see the diagram {5.5)) 



T{q) :=he 070^(5) ^hlf (7(g)), 



(5.8) 



where q :— 7r(g), i.e., 



Xiq) := he ° (FZ)rl(7(g-)), 7(9) := hl^jiq)) 



Then, T = X® 7 satisfies the Dirac-Hamilton-Jacobi equation (4.5) as well as condition (4.2) 
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Proof. This proof is very similar to that of Theorem 4.1 in Ohsawa et al. 



The diagram (5.5) shows that if the one-form 7 satisfies Eq. (5.6 1 then the map T defined by Eq. (5 



satisfies the Dirac-Hamilton-Jacobi equation (4.5) 



To show that it also satisfies the condition (4.2 ), we perform the following calculations: Let F'^ e X{Q) 
be arbitrary horizontal vector fields, i.e., Y^, e Aq((7) for any q € Q. We start from the following identity: 

d7(r^ Z^) = y^[7(Z'^)] - Z'^ijiY"^)] - j{[Y\ Zh]). (5.9) 

The goal is to show that the right-hand side vanishes. Let us first evaluate the first two terms on the right- 
hand side of the above identity at an arbitrary point q G Q: Let Zg :— TqTTQ{Zg) G TgQ, then Z^ = h\^{Zq). 
Thus, we have 



7(Zh)(<z) = (hlf o7(g-),hlf(Z,; 
= l{Z){q)- 

Hence, writing jz = for short, we have 7(Z'^) = 7^0 tt. Therefore, defining Yq := TqTT{Yq^), i.e., 

Yq^=hlf{Yq), 

Yh[7(Z'^)](<z) = r'^[7z°^](g) 

= (d(7zo^)^,rh) 

= {in*djzh,Yq^) 
= {d^z{q),Tq^{Y^)) 
= {djziq),Yq) 

= Ybzm 

= Y[-,{Z)]iq). 

Hence, we have 

Y^'hiZ^)] - Z^MY'^)] = Y[^{Z)] Z[7(y)], (5.10) 

where we have omitted q and q for simplicity. 

Now, let us evaluate the last term on the right-hand side of Eq. (5.9): First we would like to decompose 
[y'^,Z'^]g into the horizontal and vertical parts. Since both Y^ and Z^ are horizontal, we haveQ 



whereas the vertical part is 



hor([r'\z'>],) = hif([r,z],), 

veTi[Y\z\) = {Aqi[Y\Z%)) (q) = (55,(y,^ Z,^)) (g), 



where we used the following relation between the connection A and its curvature ® that holds for horizontal 
vector fields Y^ and Z^: 

'BqiYq\Z^)^dAqiYq^,Z^) 

= Y'^[A{Z'^)]{q) - Z'^[A{Y'^)]{q)-A{[Y'\Z'^]){q) 
= -Ai[Y\Z'^])iq). 
As a result, we have the decomposition 



Therefore, 



[Y\ z\ = hif ([y, z]q) (s,(y,^ z^q))^ (q). 

^i{Y\Z^])iq) = (hlf o7o^(g),hlf ([y,Z],)) - (hlf 070^9), {'Bq{Yq\Z^))^ (q)) 
= (7(g), [Y,Z]q) (j (hlf o7(g-)) ,^q{Y^\Z'^)) 

^nY,Z]){q)-r^iY,Z){q), (5.11) 



^See, e.g., Kobayashi and Nomizu |31i Proposition 1.3 (3), p. 65]. 
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where the second equahty follows from Eq. (5.1 ) and the definition of the momentum map J; the last equality 
follows from the definition of S in Eq. (A.9): Let ttq : T*Q ^ Q he the cotangent bundle projection; since 
ttq o ^ = idg and thus Tttq ° T7 = id^Q, we have 

rS{Y,Z){q) - {Tj{Y,),Tj{Z,)) 



Jo hi 



Substituting Eqs. (5.10) and (5.11) into Eq. (5.9), we obtain 



djiv^z"^) = YMZ)] - z[j{Y)] - 7([y,z])(q-) + 7*s(r,z) 

^d^(Y,Z)+r^{Y,Z) 
= (d7 + rS) {Y,Z)=0. 



□ 



Example 5.3 (Simplified Roller Racer; see Examples 3.4 and 4.6 1. The Lie algebra g of G = is identified 
with R-^; let us use as the coordinates for g. Then, we may write the connection A : TQ — > g as 

T d 9 d 
or] 



where uj^ and are the constraint one-forms defined in Eq. (3.4); hence its curvature is given by 

53 = - csc^ (j)[di cos 6* + ^2 cos(6' + (j))]d6 A d0 - csc^ 4>[di sin6' + d2 sm{6 + (j))]d0 Ad4><^ d-q. 
Furthermore, the momentum map J : T*(5 — >■ g* is given by 



Therefore, we have 



J(P<}) ^Px d^+pydrj. 
'di 



-P4> 



do 



+ cos (j) CSC (f)d9 A d(j). 



Since the reduced Lagrangian L (see Eq. (3.6|) is non-degenerate, we have the reduced Hamiltonian 
H : T*Q R given by 



H 



1 

2h 



We assume the ansatz 



di . 

1 + — cos ) P0 



I4,{e,(f) = 70 («!)). 



2 sin2, 
-I- 



Then, the reduced Dirac-Hamilton-Jacobi equation (5.6) gives 



1 
2iT 



di 



1 + ^cos(/) ) 70(0) 



which implies that jg{9,4') = ^g{<j)). Solving this for 70(0) gives 



2 • 2 
sm 



2mid: 



2 740)2 = E, 



1 -I- — cos ( 

02 



70('^)±lMi 2i? 



mid\ 



Substituting the first case into Eq. (5.7), we obtain 

= - cot 07^(0), 

which gives 

70(0) = Ccsc0 

for some constant C. Therefore, 



7e(0) = C ( 1 + ^ COS0 ) csc0-h J/i ( 2E 



(J2 



It is straightforward to check that, with the choice 



C = d2Jmi{2E~hvl), 



Eq. (5.8) gives the solution obtained in Example 4.6 
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Remark 5.4. Notice that the ansatz we used here is less elaborate compared to the one, Eq. (4.13), used for 
the Dirac-Hamilton-Jacobi equation without the reduction. Specifically, accounting for the M ■^-symmetry is 
not necessary here, since the reduced Dirac-Hamilton-Jacobi equation is defined for the M^-reduced system. 



6. Conclusion and Future Work 

Conclusion. We developed Hamilton-Jacobi theory for degenerate Lagrangian systems with holonomic 
and nonholonomic constraints. In particular, we illustrated, through a few examples, that solutions of 
the Dirac-Hamilton-Jacobi equation can be used to obtain exact solutions of the equations of motion. 
Also, motivated by those degenerate Lagrangian systems that appear as simplified models of nonholonomic 
mechanical systems, we defined a class of degenerate nonholonomic Lagrangian systems that reduce to non- 
degenerate almost Hamiltonian systems. We then showed that the Dirac-Hamilton-Jacobi equation reduces 
to the nonholonomic Hamilton-Jacobi equation for the reduced non-degenerate system. 

Future Work. 

• Relationship with discrete variational Dirac mechanics. Hamilton-Jacobi theory has been an im- 
portant ingredient in discrete mechanics and symplectic integrators from both the theoretical and 
implementation points of view (see Marsden and West [38, Sections 1.7, 1.8, 4.7, and 4.8] and Chan- 
nell and Scovel [ID]). It is interesting to see if the Dirac-Hamilton-Jacobi equation plays the same 
role in discrete variational Dirac mechanics of Leok and Ohsawa [37] . 

• Hamilton-Jacobi theory for systems with Lagrangians linear in velocity. As briefly mentioned in Sec- 
tion [43j the Dirac-Hamilton-Jacobi equation is not appropriate for those systems with Lagrangians 
that are linear in velocity. However, Rothe and Scholtz ^44^ Example 4] illustrate that their for- 
mulation of the Hamilton-Jacobi equation can be applied to such systems. We are interested in 
a possible generalization of our formulation to deal with such systems, and also a link with their 
formulation. 

• Asymptotic analysis of massless approximation. Massless approximations for some nonholonomic 
systems seem to give good approximations to the full formulation. It seems that the nonholonomic 
constraints "regularize" the otherwise singular perturbation problem, and hence makes the mass- 
less approximations viable. We expect that asymptotic analysis will reveal how the perturbation 
problem becomes regular, particularly for those cases where massless approximations lead to weakly 
degenerate Chaplygin systems. 



Hamilton-Jacobi theory for general systems on the Pontryagin bundle. Section 2.4 naturally leads 
us to consider systems on the Pontryagin bundle described by an arbitrary Dirac structure. We are 
interested in this generalization, its corresponding Hamilton-Jacobi theory, and its applications. 
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Appendix A. Reduction of Weakly Degenerate Chaplygin Systems 

A.l. Constrained Dirac Structure. We may restrict the Dirac structure -Daq to P C T*Q as follows (see 
Yoshimura and Marsden [321 Section 5.6] and references therein): Let us define a distribution J£ C TP on 
Pby 

K-.^TPnAr^Q, (A.l) 

and also, using the inclusion Lp : P ^ T*Q, define the two-form flp tpO on P. Then, define the 
constrained Dirac structure Dp C TP T*P, for each z € P, by 

Dp{z) := [{v,,a,) e T,P®T*P \ £ 5£„ a, - nUz){vz) e Jf^} , 
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where fip : TP — > T* P is the flat map induced by ilp. Then, we have the constrained Lagrange-Dirac 
system defined by 

{Xp,T)L,) e Dp, (A.2) 

where Xp is a vector field on P, Lc '■— ^Iaq the constrained Lagrangian, and SLc(u) 'DL{u)\tp for any 
u e Aq. 

If the constrained Lagrangian is non-degenerate, i.e., the partial Legendre transformation FL|aq : 
Aq ^ P is invertible, then we may define the constrained Hamiltonian Hp : P — >■ M [S3] by 



HpiPq) (Pq 



Lc{Vq), 



where v„ 



(FP|aq) ^{Pq)- Then, the constrained Lagrange-Dirac system (A.2), is equivalent to the 
constrained implicit Hamiltonian system defined by 

{Xp,dHp)eDp. (A.3) 

Remark A.l. Let 

:= fipk (A.4) 
be the restriction of Hp to J{ C TP and hence a skew-symmetric bilinear form in J{. If fl^ is non-degenerate. 



then Eq. (A.3) gives 



ixp^:K — dHp\^, 

which is nonholonomic Hamilton's equations of Bates and Sniatycki [2] (see also Koon and Marsden [33]). 

A.2. Reduction of Constrained Dirac Structure. Let us now show how to reduce the constrained Dirac 
structure Dp to a Dirac structure on T*Q. This special case of Dirac reduction to follow gives a Dirac point of 
view on the nonholonomic reduction of Koiller |32) , and hence provides a natural framework for the reduction 
of weakly degenerate Chaplygin systems. See Yoshimura and Marsden [5S] for reduction of Dirac structures 
without constraints, Jotz and Ratiu j30j for the relationship between Dirac and nonholonomic reduction of 
Bates and Sniatycki [2]; see also Cantrijn et al. [6l[7] for a theory of reducing degenerate Lagrangian systems 
to non-degenerate ones. 

Let $ : G X Q — >• Q be the action of the Lie group G given in Definition 13.1] and T*^g-i : T*Q ^ T*Q 
be its cotangent lift defined by 

{T*<^g-i{a),v)^{a,T<^g-i{v)). 

It is easy to show that the G-symmetries of the Lagrangian L and the distribution Ag imply that the 
submanifold P G T* Q is invariant under the action of the cotangent lift. Hence, we may restrict the action 

P ^ P hy -.^ T*^g-i\p for any g e G. This gives rise to 



P, i.e. 



9 



to P and define : G x P 
the principal bundle 

7rg : P P/G. 

The geometric structure of weakly degenerate Chaplygin systems summarized in Section |5.1| gives rise to 
a diffeomorphism : T*Q — > P/G; this then induces the map p : P ^ T*Q so that the diagram below 
commutes (see Hochgerner and Garcfa-Naranjo [27)). 



(A.5) 




Furthermore, the principal connection A : TQ — )• g defined in Eq. (3.1) induces the principal connection 
Ap -.TP ^ g defined by 

Ap := A° Tttq °Tlp, 



and the horizontal space for this principal connection is J{ C TP defined in Eq. ( A.l ), i.e., J{ ~ 'kexAp 1271 . 
Therefore, writing [z] :— ttq{z) E P/G, we have the horizontal lift 



hi 



^ • r[.](P/G) ^ M,; ^ (P.^^kJ-H%])- 
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Then, clearly the following diagram commutes: 




(A.6) 

T^4P/G) ^T,T*Q 

where z ip^^{[z\) G T*Q. 

Lemma A. 2. The two-form U,p is invariant under the G-action, i.e., for any g € G, 

{<s>^rnp = np. (A.7) 

Proof. Using the relation T*^g-i ° Lp — Lpo we have 

= {Lpo^pyn 

= (tp)*o(T*<i><,-i)*f7 
= i*pQ. 

^np, 

where we used the fact that the cotangent lift T*^g-i is symplectic. □ 
Now, consider the action of G on the Whitney sum TP (BT*P defined by 

^ ■.Gx{TP(BT*P)^TP(ST*P; (5 a, )) (t, T;, i («,))=: (g • g • a,) . 

Then, we have the following: 

Proposition A. 3. The constrained Dirac structure Dp is invariant under the action ^ defined above. 

Proof. Let z e P be arbitrary and {vz,az) G Dp{z). Then, Vz & and Uz — i^p{vz) G 3^°- Now, the 

l>g{vz) e 'Kgz. Also, for any Wgz = Tz<t>^{wz) G ^gz 



G-invariance of J{ = keryip implies T$g(wz) G Jigz. Also, for any Wgz — Tz^g{wz) G Oigz with Wz G Oiz, 



we have 



T;,$;_,(a,) - n'p {Tz<i>^{vz)) .Wgz) = (az.Tgz<^^g-.{wgz)) - Op (r,*^ (t;,), T,$f K)) 

= (a„ Wz) - np {Tz<^>^{vz), Tz<^>^{wz)) 

= {az,Wz) - i'^'g Y^P {Vz,Wz) 
= (az.Wz) - ^Ip {Vz,Wz) 

= (^az - Q^p{vz),Wz) 
= 0, 



where the fourth line follows from Eq. (A.7). Hence 



{g ■ Vz,g ■ az) ^ (Tz<S>^{vz),T;,<^P^^{az)) e Dp{gz), 

and thus the claim follows. □ 

Now, the main result in this section is the following: 

Theorem A. 4. The reduced constrained Dirac structure [Dp]q := Dp/G is identified with the Dirac struc- 
ture D on T*Q defined, for any z £T*Q, by 

D(z) := 



(z) := {{vz,az) G TzT*Q © T|T*Q | a, = (O"'')^^-)} , (A., 
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where fi"'^ = £7 — 5 with ft being the standard symplectic form on T*Q, and the two-form ^ on T*Q is 
defined as follows: For any o.q ^ Tq Q and ^a^? ^ 

T*Q, let Yq := TTrg^^a-) and Zg :== TTTQ{Za^) 
where ttq : T*Q ^ Q is the cotangent bundle projection, and then set 



(A.9) 



where 3 : T* Q ^ g* is the momentum map corresponding to the G-action, and 3 is the curvature two- form 
of the connection A. 



Lemma A. 5. Define, for any z E P, 



where (hl^ )* : T* P ^ T^^^{P/G) is the adjoint map o/hl^ . Then, f is G-invariant, i.e., f°'^g = / for any 
geG. 

Remark A. 6. The map fz\Dp{z)^ i-^-, fz defined above restricted to Dp{z) C TzP ®T*P, is the backward 
Dirac map (see Bursztyn and Radko [5]) of 



hi 



Tz^:TzT*Q^TzP, 



that is, fz = 23(/)2 using the notation in [5]; hence the image f{Dp) C TT*Q © T*T*Q is a Dirac structure. 
Proof of Lemma \A.5\ Let {vz,az) € TzP ®T*P and (vgz,dLgz) '.— ^g(uz,Q!2) for g e G, i.e., 



'^gz 



Using the identities p = f ^ ° ttq (see diagram (A. 5 )) and ttq o $^ ~ ttq, we have 

TgzP{Vgz) = TgzpoTz^^iVz) 

= T[z]ip-'^TgzTr^oTz^P{vz) 
= T[z]V'^oTz{7r^o<i>P){v,) 

= T[z]^^^ °TzT:g{vz) 
= Tzp{vz). 

On the other hand, for any W[z\ € T[z]{PlG), 

(hlfj*(cig,) - (hi;fj*oT;,$;_,(a.) 



(r3,<i>f_,ohlf,)*(a,) 
(hlf)*(a.), 



because of the invariance property of the horizontal lift hi , i.e., Tgz^^^i ° hl^^ = hl^ . Hence it follows that 

fgz O 'i^g{Vz,0!z) = f gz{v gz, Olgz) = fz{Vz,az). □ 



Proof of Theorem \A.4\ Lemma A. 5 implies that the map flop defined above induces the following well- 
defined map: 



/ : [Dp]g TT*Q®T*T* 
i.e., the diagram below commutes. 



[{vz, az)]G ^ (Tzp{vz),T;^o (hlf )*(«,) 




Let us look into the image D :— /([_Dp]g). Notice first that 

Tzp{:Kz) = T[z]ip~'°Tz7:^{^z) = TzT* 
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since TzTTqCKz) — T[^]{P/G) and T[z](p ^ is surjective. 

On the other hand, notice that '■— hl^ o Tg(p{wz) is in Oi^ for any Wg G TgT*Q, whereas — n^p{vz) S 
5£°. So we have 

0= (a.-f^^K),hlf orj</,K-)^ 
= (r| ^ o (hlf ) * a, - r| ^ o (hl^ ) * (t;, ) , zi;^ ) . 

Therefore, 

r;^ o (hif )*a, = t;^o (hi;^)*r!^(«,). 

However, for an arbitrary Wg G TzT*Q, 

^T|(/?o(hl^)*17p(w^),W2^ =f2p(^W2,hl^ or2.^(w2)^ 

= fl:K(vz,h\f oTgip{wg)j 



where the second hne fohows from the definition of lljc, Eq. (A. 4), since (vz^az) G Dp{z) imphes Vz £ "Kz] 
the third hne foUows from p*d'^^\^ = Vt'ji (see [Ul Proposition 2.2]); the fifth from diagram (A. 6 1. As a 
result, we have 

and thus 

f{[{vz,az)]G) = f{vz.az) = {Tzp{vz),{n-''i'oTzp{vz)) . 

Since Tzpi'Kz) = TzT*Q, the image D = /([Z^pIg) = f{Dp) is given by Eq. ( [Xs] ). □ 

A. 3. Reduction of Weakly Degenerate Chaplygin Systems. Reduced dynamics of the constrained 
impUcit Hamiltonian system, Eq. (A. 3 1, for weakly Chaplygin systems follows easily from Theorem A. 4 For 
weakly Chaplygin systems, it is straightforward to show that the constrained Hamiltonian Hp is related to 

(A.IO) 



the reduced Hamiltonian defined in Eq. (5.2) as follows: 

H ^ Hp ohl^, Hp^Hop^ 
and also that if {Xp, dHp) E Dp then, defining X(z) := Tzp ■ Xp{z), we have 

f{Xpiz),dHp{z)) ^ {X{z),dH{z)) , 
because, using hl^ oTgip = (T^pljtJ^^ (see diagram ( |A.6P ) and Eq. ( |A.10[ ), for any Vz e TzT*Q, 

T*,ipo{h\fydHp{z),Vz) = (dHp{z)Mf °Tzv{v-z 



= {dHp{z),{TzpWX\vz)) 
= {p*dH{z),{TzpW^-\v,)) 

= {dH{-z),Tzpo{TzpWX\y-z)) 
= {dH{-z),Vz). 



Therefore, the constrained implicit Hamiltonian system, Eq. (A.3), reduces to 

{X,dH) e D, 

or 



iv^rj"'' = dH. 
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Remark A. 7. Again, this result is essentially a restatement of the nonholonomic reduction of Koiller [32] 
(sec also Bates and Sniatycki Cantrijn et al. [7], and Hochgerner and Garci'a-Naranjo [57]) in the language 
of Dirac structures and implicit Hamiltonian systems. 
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